Some Concept of Dispersion Measure for Categorical Data.
نویسندگان
چکیده
منابع مشابه
DISC: Data-Intensive Similarity Measure for Categorical Data
The concept of similarity is fundamentally important in almost every scientific field. Clustering, distance-based outlier detection, classification, regression and search are major data mining techniques which compute the similarities between instances and hence the choice of a particular similarity measure can turn out to be a major cause of success or failure of the algorithm. The notion of s...
متن کاملAn association-based dissimilarity measure for categorical data
In this paper, we propose a novel method to measure the dissimilarity of categorical data. The key idea is to consider the dissimilarity between two categorical values of an attribute as a combination of dissimilarities between the conditional probability distributions of other attributes given these two values. Experiments with real data show that our dissimilarity estimation method improves t...
متن کاملTransiogram: A spatial relationship measure for categorical data
Categorical geographical variables are normally classified into multinomial classes which are mutually exclusive and visualized as area-class maps. Typical categorical variables such as soil types and land cover classes are multinomial and exhibit complex interclass relationships. Interclass relationships may include three situations: cross-correlation (i.e. interdependency), neighbouring situa...
متن کاملSurvey on Clustering Algorithm and Similarity Measure for Categorical Data
Learning is the process of generating useful information from a huge volume of data. Learning can be either supervised learning (e.g. classification) or unsupervised learning (e.g. Clustering) Clustering is the process of grouping a set of physical objects into classes of similar object. Objects in real world consist of both numerical and categorical data. Categorical data are not analyzed as n...
متن کاملClustering Categorical Data Using an Extended Modularity Measure
Newman and Girvan [12] recently proposed an objective function for graph clustering called the Modularity function which allows automatic selection of the number of clusters. Empirically, higher values of the Modularity function have been shown to correlate well with good graph clustering. In this paper we propose an extended Modularity measure for categorical data clustering; first, we establi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Japanese journal of applied statistics
سال: 1998
ISSN: 0285-0370,1883-8081
DOI: 10.5023/jappstat.27.83